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ABSTRACT 

Using the Sloan Digital Sky Survey, Data Release 5 (SDSS-DR5), we survey proximate damped Lya 
systems (PDLAs): absorption line systems with H I column density 2Vhi > 2 x 10 20 cm -2 at velocity 
separation Sv < 3000 km s" 1 from their background quasar. These absorbers are physically associated 
with their background quasars, and their statistics allow us to study quasar environments out to z ~ 5. 
However, the large ionizing flux emitted by a quasar can ionize the neutral gas in a nearby galaxy possibly 
giving rise to a "proximity effect", analogous to the similar effect observed in the Lya forest. From a 
sample of 111, we measure the H I frequency distribution f{Nsx,X), incidence, and gas mass density 
of the PDLAs near luminous quasars over the redshift interval z = 2.2 to 5. The incidence and mass 
density of PDLAs at z ~ 3 is approximately twice that of intervening DLAs, but at z < 2.5 and z > 3.5 
the f(Njn,X) distribution is consistent with the intervening population. We interpret the observed 
enhancement of PDLAs around quasars in terms of quasar-galaxy clustering, and compare the strength of 
the clustering signal to the expectation from independent measures of the respective clustering strengths 
of DLAs and quasars, as well as a complementary analysis of the clustering of absorbers around quasars 
in the transverse direction. We find that there are a factor of 5 — 10 fewer PDLAs around quasars than 
expected and interpret this result as evidence for the hypothesis that the ionizing flux from the quasars 
photocvaporates H I in nearby DLA galaxies, thus reducing their cross-section for DLA absorption. This 
constitues the first detection of a "proximity effect" for DLAs. 
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1. INTRODUCTION 

Although quasars are believed to reside at the centers 
of massive galaxies which are themselves located in groups 
or clusters, their spectra rarely exhibit the absorption sig- 
nature of either their host's neutral interstellar medium 
(ISM) or the ISMs of associated nearby galaxie^. Nearly 
every galaxy at z » 3 should exhibit an ISM comprised of 
neutral hydrogen. If the quasar sightline penetrates this 
ISM, it will absorb light along the hydrogen Lyman series, 
with strongest absorption at Lya: A rcst = 1215. 67A. Sys- 
tems with H I column densities A/hi exceeding 2 x 10 20 cm" 
are termed damped Lya (DLA) syste ms owing to the strong 
damp : 

2005, for a review) . Gas clouds at these high column den- 
sities, characteristic of a galactic disk, are optically thick 
to Lyman continuum (tll ^ 1) photons, giving rise to a 
neutral interior self-shielded from the extragalactic ioniz- 
ing background. 

One might guess that the absence of DLAs near most 
quasars is related to the large ionizing flux emitted by the 
quasar. In particular, for a quasar at z = 3 with an i-band 

5 The relatively few cases where one observes significant ab- 
sorption at 2 fa z em are characterized by very broad features 
and are termed Broad Absorption Line (BAL) quasars. These 
BAL quasars are typically attributed to gas with extreme densi- 
ties and lar ge velocity fields residing in the broad line region of the 
quasar fe.e.lWevmann et al. 111 991; Tole a, Krolik fc Tsvctanov| [2u0l; 
IReichard et al. 1 120031 ; ITrump et al. 1 120061) and should not be con- 
fused with the much larger galactic scale absorption which causes 
DLAs. 



magnitude of i = 19.1, the flux of ionizing photons is 400 
times higher than that of the extragalactic UV background 
at a comoving distance of 1 h" 1 Mpc (corresponding to 
Hubble flow velocity of 100 km s _1 ), and increasing as r~ 2 
toward the quasar. Indeed, the decrease in the number of 
optically thin absorption lines (log A/hi < 17.2 hence tll < 
1), in the vicinity of quasars, know n as the proximity effect 
(jBaitlik. Duncan fc Ostrikerlll988l), is well studied and has 
been detected ([Scott et al. I |2000T iFaucher-Giguere et al. I 
[200l . 

On the other hand, it has long been known that quasars 
are associated with enhancements in the distribution of 



Gunnlll969tlYee fc Greenl l984 
t^87tlBahcall fc Chokshill99ltlSmith. Boyle fc Maddoxl200C 
i Brown. Boyle fc WebsterfeOOlHSerber et al. l2006tlCoil al.\ 
|2007| ) , although these measurements of quasar galaxy clus- 
tering are mostly limi t ed to low redshifts < 1.0. Recently, 
lAdelberger fc Steidell (|2005f ), measured the clustering of 
Lyman Break Galaxies (LBGs) around luminous quasars 
in the redshift range (2 < z < 3.5), and found a best 
fit correlation length of ro = 4.7 h 1 Mpc (7 = 1.6), 
very si milar to the auto-correlation length of z ~ 2 — 3 
LBGs (lAdelberger et al. l[2lM[2005n . iCooke et al. 1 (pool 
recently measured the clustering of LBGs around DLAs 
and measured a best fit ro = 2.9 h" 1 Mpc with 7 = 
1.6, but with large uncertainties (see also iGawiser et al. I 
I200H iBouche fc Lowenthal|[200l . If LBGs are clustered 
around quasars, and DLAs, might we expect DLAs to 
be clustered around quasars? This is especially plausi- 
ble in light of recent evidence that DLAs arise from a 
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high redshift galaxy population which are not unlike LBGs 
(|Moller et aL 112002ft . 

Can DLAs continue to self-shield against the large flux 
of ionizing photons emitted by quasars or are they be pho- 
toevaporated? Is the distribution of DLAs around quasars 
dominated by ionization effects or does galaxy clustering 
around the quasars dominate? Does the column density 
distribution of DLAs near quasars differ from the distribu- 
tion in average places in the Universe? What can quasar- 
DLA clustering teach us about quasars and the physical 
nature of high redshift galaxies? 

In this paper, we take positive step towards answering 
these questions by measuring the abu ndance and distribu- 
tion o f DLAs near quasars. Recently, (|Prochaska. Herbert-Fort 
120051 ; hereafter PHW05) surveyed the quasar spectra 
of the Sloan Digital Sk y Survey (SDSS) Data Release 3 
(|Abazaiian et ani2005f ) for damped Lya systems at z > 
2.2. The statistical survey, comprising over 500 DLA sys- 
tems, provides a relatively precise measure of the incidence 
of intervening DLA systems per unit redshift, i(z), from 
z = 2.2 to 4. To focus on the cosmic average of £(z)and 
avoid biases from the quasar environment, PHW05 ex- 
cluded pjTOMnateTJLAs (PDLAs) from their analysis, which 
arc defined to be DLAs with velocity offset Sv < 3000 km s _1 
from the emission redshift of the background quasar^. 
Here we present the results of a survey for proximate damped 
Lya (PDLA) systems in the fu ll SDSS Data Release 5 
(DR5 lAdelman-McCarthv et al. |[2006ft . 

The first st a tistic al study of PDLAs was performed by 
Ell ison et aT\ (|2002ft who surveyed their sample of radio- 
selected QSOs. Our work is motivated by this and several 
recent, closely related studies. First, Russell et al. (2006; 
hereafter REB06) have previously performed a search of a 
subset of the SDSS-DR3 quasar spectra for PDLAs. They 
reported an enhancement of £(z) for DLA systems with 
Sv < 3000 km s" 1 and, oddly, an enhancement of PD- 
LAs with 3000 km s" 1 < Sv < 6000 km s^ 1 , correspond- 
ing to very large distances (30 — 60 hr 1 Mpc) from the 
quasar if interpreted as Hubble flow. Our search builds on 
their work, but we survey the larger SDSS-DR5 to recover 
a final sample with nea rly an order of magn itude more 
PDLA systems. Second. [Hennawi et ~aT\ <|2P06) published 
a large sample of optically thick absorption line systems 
in the vicinity of z ~ 2.5 quasars, which were identified 
using a background line-of-sight in close projected quasar 
pair sy stems. Based on this sample. iHennawi fc Prochaskal 
(200(f) measured the cross-correlation function between 
foreground quasars at z ~ 2.5 and optically thick Njjj > 
10 19 cm 2 absorbers, detected in the background quasar 
spectra. Their measurement of the transverse cluster- 
ing of absorbers around quasars should be commensurate 
with incidence of similar systems along the line-of-sight, 
provided that the clustering pattern of absorbers around 
quasars is isotropic. However . IHennawi fe Prochaskal {2006) 
argued that the strength of the transverse clustering pre- 
dicts that ~ 15 — 50% of all quasars should show a TVjji > 
10 19 cm -2 absorber within Sv < 3000 km s" 1 , which is cer- 
tainly not observed. Finally, another goal of our survey is 
to identify candidates for follow-up observations to study 
Lya emission from PDLAs which could possibly yield in- 

6 At z = 3, Sv = 3000 km s — 1 corresponds to roughly 30 comoving 
Mpc h _1 assuming Q m = 0.3, Qjy = 0.7 and Hq = 70kms Mpc" 1 . 



sights into fluorescent Lya recombination radiation from 
DLAs, the size of DLA galaxies, and the physics of Lya 
halos surrounding high z quasars (jHennawi et al. I ). 

In the following section, we describe the survey and 
search for PDLA candidates and, in § [3l we discuss their 
Nja measurements. A new estimate of the quasar redshifts 
with PDLA candidates is given in § [U The H I frequency 
distribution and incidence of PDLA systems is presented in 
§ [5] and we discuss these results in terms of clustering and 
the proximity effect in § [5] We summarize and conclude in 
§ [7] Throughout the paper we adopt a ACDM cosmology 
with n m = 0.3, n A = 0.7 and H = 70 km s _1 Mpc" 1 . 



fc Wolfel - SURVEY DESIGN AND PDLA CANDIDATES 



We began with the full SDSS-DR5 sample of -67000 
spectroscopically classified quasars. The SDSS fiber-fed 
spectrometer gives a FWHM resolution of ~ 2A and wave- 
length coverage spanning 3800 to 9200A. Following PHW05, 
we demanded that the quasar spectrum exhibit at least 
one region where the median signal-to-noise (S/N) of 20 
consecutive pixels equals or exceeds 4. This requirement 
eliminates many of the quasars at z > 4 which have poorer 
S/N data. For each QSO, we determine a starting and end- 
ing redshift Zi and Zf which determines the search path of 
that object: 



(Az)j = {zi - z f )j 



(1) 



In the PHW05 analysis, Zi corresponded to the first pixel 
where the data satisfied the S/N requirement under the 
restriction that this not occur within 10000km s" 1 of the 
Ly/3+O VI emission region of the QSCQ. For a search fo- 
cusing on PDLA systems, we will define Zi to be a veloc- 



ity v p 



blueward of z em under a strict restriction: the 



quasars must also satisfy the S/N criterion at a velocity 



greater than (3000km s 1 



c ) from the QSO. We 



adopt this restriction for two reasons: (i) we wish to avoid 
the presence of a DLA system with v « v prox from biasing 
the S/N below our criterion; (ii) as discussed in § we 
wish to maintain a search path (Az)j for each QSO to be 
set exactly by v prox . 

The total redshift search path of the s urvey can be su m- 
marized by its sensitivity function fe.g. lLanzettalfl993ft : 



g(z) = ^2e(z) j 



(2) 



where Q(z)j is unity if z + Sz is included within a given 
quasar's search path (Az)j and zero otherwise. The g(z) 
function for the SDSS-DR5 database is shown in Figure Q] 
for v prox = 3000 and 6000km s _1 (bottom and middle 
curves respectively). For comparison, we show the same 
quantity for the intervening DLA search of PHW05 (cor- 
rected for the error described in footnote [7]). In contrast 
to the intervening g{z) curve, the PDLA g(z) curves show 
significantly more structure. The g(z) curves closely track 
the redshift distribution of SDSS quasars and most of the 

7 Note that PHW05 failed to implement this restriction in their 
published sample. Although the differences to the computed 
statistics of the intervening DLA systems are small, future anal- 
yses by those authors will properly implement the algorithm and 
the authors provide the correct S DSS-DR3 statistical sample at 
|http://www. ucolick.org/ ~xavier/SDSSDLA| The analysis here in- 
cludes the proper statistical sample. 
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Table 1 

SDSS-DR5 QUASAR SAMPLE FOR PDLA 



Plate 


MJD 


FibcrlD 


Name 


qso new 


fl3AL 


% candidate 


750 


52235 


82 


J000009. 38+135618.4 


2.240 







650 


52143 


178 


J000050.60- 102155.8 


2.640 





2.604 


750 


52235 


550 


J000143. 41+152021.4 


2.638 





2.602 


650 


52143 


519 


J000159.12-094712.4 


2.308 





2.276 


387 


51791 


556 


J000221. 11+002149.4 


3.057 





3.017 


650 


52143 


111 


J000238. 41-101149.8 


3.941 







750 


52235 


608 


J000300.34+160027.7 


3.675 





3.629 


650 


52143 


48 


J000303. 34-105150.6 


3.647 


1 


3.600 


750 


52235 


36 


J000335. 21+144743.6 


3.484 







650 


52143 


604 


J000413. 63-085529.5 


2.424 





2.390 



a Quasar redshift reported in SDSS-DR5. 

b New quasar redshift estimated using the algorithms developed by Hcnnawi et al. (2006) and Shcn et al. I <2007l) . This analysis was performed 
on quasars with PDLA candidates lying within 10,000km s _1 of the SDSS-DR5 redshift. 
c 0=No intrinsic absorption; l=Mild intrinsic absorption, included in analysis; 2=Strong intrinsic absorption, excluded. 

^Starting redshift for the PDLA search assuming an offset of v prox = 3000 km s — 1 from the quasar, allowing for the revised redshifts in column 6. 
Note. — [The complete version of this table is in the electronic edition of the Journal. The printed edition contains only a sample.] 
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Fig. 1. — Curves of the redshift selection function g(z) as a 
function of redshift for (top) quasars satisfying the search criterion 
of PHW05 for DLA drawn from the SDSS-DR3; (middle) quasars 
drawn from the SDSS-DR5 for a search within v pr0 x = 6000 of the 
quasar emission redshift z em ; and (bottom) quasars drawn from the 
SDSS-DR5 for a search within v pr ox = 3000 of 2 em . The bin size in 
the figure is Sz = 10 -3 . 



structure i s the result of the SPS S quasar target selection 
algorithm (|Richards et al. Il2004h . On average, the inter- 
vening DLA search path is w 10 x higher than the PDLA 
path for z < 3.5 and « 5x higher at z ~ 4. If the incidence 
of PDLA systems matches that of intervening DLAs, then 
we will expect to find a statistical sample with 5 to 10 x 
fewer systems than PHW05, i.e. as 75 systems. 

Every quasar with an emission redshift z em measured 
to be greater than 2.2 by the SDSS pipeline was searched 
for a PDLA candidate. The search algorithm is nearly 
identical to the algorithm described in PHW05 for in- 
tervening absorbers, only modified to find candidates up 
to 7000km s" 1 redward of (1 + z em ) x 1215.67. The al- 



gorithm keys on any regions in the spectrum where the 
S/N is coherently low over a window of w 25 A. The algo- 
rithm triggers on the cores of damped Lya systems, but 
also on the metal-line absorption features of strong BAL 
systems. Therefore, we removed BAL quasars from the 
search sample; all of the SDSS-DR5 quasars were visually 
inspected and those exhibiting very strong C IV profiles 
at 2 ~ z em were eliminated. Because many PDLA sys- 
tems will exhibit strong C IV absorption, we chose to flag 
only those where the C IV doublet is blended with itself 
(i.e. Avciv > 500 km s^ 1 ) and the rest equivalent width 
W\ exceeds « 3A. We admit, however, that this 'by-eye' 
analysis is somewhat subjective and, in general, we conser- 
vatively included border-line cases under the expectation 
that a true BAL would not show damped Lya absorption. 
All of the quasars comprising the search are listed in Ta- 
ble [IJ 

3. N m MEASUREMENTS 

Our procedure to measure the A^hi values of the PDLA 
candidates follows that of PHW05 and, in fact, we adopt 
their published values for PDLA candidates from the Data 
Release 3. For each new PDLA candidate, we searched for 
the presence of metal-line absorption near the redshift of 
the Lya centroid. Because the absorption redshift coin- 
cides with the quasar emission redshift (z a b s w z em ), the 
strong low-ion transitions of Si II 1206, C II 1334, and 
Al II 1670 generally lay clear of the Lya forest and we 
found that nearly every PDLA system exhibit at least one 
of these transitions. We defined z a bs to be the centroid of 
the strongest, unblended low-ion transition. 

We then fit a Voigt profile to the candidate's Lya profile 
while also estimating the quasar continuum. The latter ef- 
fort is significantly complicated by the quasar's Lya and 
N V emission lines. We guided the continuum placement 
by the observed heights of the Ly/3 and C IV emission fea- 
tures, but the systematic uncertainty related to this exer- 
cise dominates the errors reflected in our measurements of 
AThi. Therefore, while we have reported relatively conser- 
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Fig. 2. — A set of representative fits from our analysis of 
PDLA candidates. The dotted line above the fit traces the pre- 
dicted continuum of the quasar. It is complicated by the pres- 
ence of Lya and N V emission. The fit is shown by the light 
solid line and the estimated la uncertainty is given by the darker 
lines which bracket it. The full set of fits are presented at 
http://www.ucolick.org/~xavier/SDSSDLA 



vative errors for the measurements, we are certain the error 
distribution is non-Gaussian. For example, we expect that 
the probability of an error greater than 3u is not less than 
0.27%. We do believe, however, that rj 95% of the values 
lie within 2a of the reported Nm measurements. 



Table 2 

SDSS-DR5 PROXIMATE DLA STATISTICAL SAMPLE 



Quasar 



Zabs 



f, 



mil— 



log 



J004142.5-085704.6 3.6069 2 

J014049.1-083942.5 3.6966 2 

J014214.7+002324.3 3.3482 2 

J074823.8+332051.2 2.9755 2 

J075901. 2+284703.4 2.8226 2 

J080523.3+214921.1 3.4774 

J080553.0+302937.3 3.4294 2 



20.65 
20.70 
20.40 
20.45 



+U.-2U 



-(1.2(1 
-,+0.15 
-0.15 
-,+0.15 
-0.15 
-+0.15 
-0.15 

21.05± u: l 5 5 
21.65+S-JS 



20.40 



0.15 
+0.15 
0.15 



Table 3 

SDSS-DR5 PROXIMATE DLA 
SAMPLE 



NON-STATISTICAL 



Quasar 






f t> 

J mil 


log JVhi 


J001134.5-, 


-155137.4 


4.3175 


1 


20.501^ 


J001134.5- 


-155137.4 


4.3592 


2 


21 10+ - 20 
zi.iu_ 20 


J014214.7-! 


-002324.3 


3.3482 


2 


20.40l ;i| 


J074823.8- 


-332051.2 


2.9755 


2 


20.45t°;« 


J075901.2n 


-284703.4 


2.8226 


2 


ai.05lg;» 


J080553.0H 


-302937.3 


3.4294 


2 


20.401°;^ 


J081114.3+393633.2 


3.0415 


1 


20.85t°;|° 


J081543.H 


-370037.0 


3.1780 


2 


20.301^° 


J081813.0+263136.9 


4.1610 


2 


21.101°;!* 


J082021.3- 


-390327.2 


4.2838 


2 


20.451°;^ 



a 0=No metals; l=Weak metals; 2=Metals detected 

Note. — [The complete version of this table is in the electronic edition 
of the Journal. The printed edition contains 
only a sample.] 



same spectra). As REB06 noted, the agreement between 
their vales and PHW05 is good. We identify a few cases 
(e.g. the DLA at z = 3.958 toward J111224.18+004630.3), 
however, where we find much lower Ahi values than REB06. 
In these cases, we have verified our lower values by ana- 
lyzing the corresponding Ly/3 profile. 

4. NEW MEASUREMENTS OF QUASAR REDSHIFTS 

In Figure [3^, we present a histogram of the velocity off- 
sets, 



Sv = 



R 2 -l 



(3) 



a 0=No metals; l=Weak metals; 2=Metals detected 

Note. — [The complete version of this table is in the electronic edition 
of the Journal. The printed edition contains 
only a sample.] 

A set of representative fits is shown in Figure [2] One 
notes, in several cases, the challenges of continuum place- 
ment (dotted line) near the Lya emission line. One also 
notes cases where the PDLA Lya profile is significantly 
blueward of the quasars Lya emission line. This is a clear 
indication that determinations of z em based primarily on 
Lya emission will give a poor result. We will return to this 
issue in the following section. The complete set of fits can 



be found at http://www.ucolick.org/^xavier/SDSSDLA 
Finally, we wish to briefly compare the Nm values we de- 
rived with those reported in REB06 (measured from the 



with R = (1 + z a b s )/(l + z ern ) between z a b s for all of the 
fitted PDLA candidates and z em from its corresponding 
quasar as listed in the SDSS-DR5 database. It is evident 
from the figure that the quasar redshifts are systemati- 
cally in error. The presence of very few PDLA candidates 
with Sv « km s _1 and many examples with very nega- 
tive Sv suggests that the Lya absorption profiles of PDLA 
systems have significantly biased the measurement of z em 
away from its correct value. 

It is well known that the primary rest-frame ultraviolet 
quasar emission lines which are redshifted into the optical 
z > 2, can differ by up to ~ 3000 km s -1 from sys- 
temic, due to outflowin g/inflowing material in the broad 
line regions of quasars (Gaskcll 19 82t iTvtler fc Fani ri992; 
IVanden Berk et al l200lHRichards et al. II2002D . The quasar 
redshifts provided by the SDSS spectroscopic pipeline come 
from a max imum likelihood fit to m ultiple emission lines 
(see e.g. IStoughton et al. I [2002) . which does not re- 
sult in robust systemic redshifts estimates. Concerned 
that the errors in z em will significantly affect our clus- 
tering analysis, we estimated the systemic emission red- 
shift of every QSO with a PDLA candidate exhibiting 
N m > 10 20 cm" 2 an d Sv < 10000 km s " 1 , following the 
approach des cribed in Shen et al\ (|2007l see also Hennawi 
et al. 2006). IShen et al. I ( 2007 ) measured the correlation 
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Fig. 3. — (a) Velocity offset Sv between z a i, s of the PDLA candi- 
dates and the quasar emission redshift z em reported in SDSS-DR5. 
Negative velocities indicate z a ba > z em . This analysis demonstrates 
that there are significant systematic errors in the emission redshifts; 
note, especially, that there are few examples with 5v ?s yet many 
PDLA candidates with Sv < —500 km s _1 . These results motivated 
us to re-measure the redshifts of all quasars with a PDLA candi- 
date w ithin 10,000km s —1 us ing the techniques of lHennawi et al7\ 
l|2006T l and IShen et al. I l|2007T l. (b) Same as (a) but adopting new 
quasar redshi fts after employing new algorithms for th e quasar emis- 
sion redshift (Hcnnawi et al. 2006; Shcn et al. 2007). The new Sv 
values have a more physical distribution near 0km s — ■ 



between the relative shifts of the high-ionization emission 
lines Si IV, C IV, C III, and the shift between these re- 
spective lines and the Mg II line. Since the redshift de- 
fined by Mg II is tightly cor relate d with the sy s temic red- 
shift ([Richards et al. 1120021) . the IShen efaU (|2007l ) ap- 
proach exploits these correlations to "shift" into the sys- 
temic frame. The emission lines in the SP SS spectra were 
center ed using the algorithm describe d in iHennawi et al. I 
(2006) which was also employed by IShen et al. I (|2007T ). 
The updated redshifts are presented in column 6 of Ta- 
ble CD 

In the lower panel of Figure [3l we present the velocity 
offsets for all PDLA candidates with Sv < 10000 km s -1 . 
We note a nearly smooth distribution of systems with Sv > 
km s -1 and « 40 PDLA with Sv < km s _1 . Although 
we expect the revised redshifts still suffer from system- 
atic uncertainties, we are optimistic that the uncertainty is 
smaller than a few hundred km s _1 and, more importantly, 
that the errors are symmetrically distributed. One may be 
concerned that we have restricted the new z em measure- 
ments to a subset of the SDSS-DR5 QSO database. As dis- 
cussed below, because we include all PDLA systems with 
negative Sv in our clustering analysis, the error in z em for 



quasars without PDLA candidates should have minimal 
effect on our results. 

5. THE H I FREQUENCY DISTRIBUTION AND ITS 
MOMENTS 

5.1. f(N m ,X) 

Akin to the luminosity function of galaxies, the fun- 
damental measure of a DLA sample is its H I frequency 
distrib ution, f(Nm,X). Here , X is the absorption dis- 
tance (Ba hcall fc Peebleslll969t) defined to give a constant 
comoving distance. Following PHW05, we have calculated 
f(Nm,X) for the PDLA systems. The results are pre- 
sented in Figure Q] and given in Table 2] We have fit- 
ted the frequency distribution with a single power-law 
f(N m , X) = k\N ai and a T-function 

f(N m ,X) = fe(^) a2 exp(. 

lihood solutions are overplotted in the Figure. The power- 
law has exponent ot\ — —2.1 ±0.1 while the T-function has 
a similar power-law slope and allows any value log A 7 > 
21.6 at the 95% cl. We conclude that the PDLA sam- 
ple does not show a break from a single power-law model. 
For comparison, we also show the best-fit T-function for 
/(Am, A) of intervening DLA from PHW05 renormalizcd 
to give the same number of PDLA systems as observed. 
The shaded region shows the approximate la uncertainty 
in the shape of this T-function. 
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Fig. 4. — H I frequency distribution f{Nm,X) for the PDLA 
systems at z > 2.2 drawn from SDSS-DR5 (binned data points). 
Overplotted on the data are three curves corresponding to the solu- 
tions from maximum likelihood analyses: (i) a single power-law fit to 
the PDLA data; (ii) a T-function fit to the PDLA data; and (iii) the 
T-function fit (with lcr uncertainty indicated by the green band) to 
the SDSS-DR3 sample of intervening DLA systems (PHW05) renor- 
malizcd to match the incidence of PDLA systems. At low iVm, 
the intervening DLA and PDLA systems exhibit roughly the same 



power-law dependence, f(Nu-[,X) oc N B 



In contrast with the in- 



tervening DLA, the PDLA systems do not show an obvious break in 
the power-law at large Nm values (> 10 21,5 cm -2 ). However, the 
difference in f{Nm,X) at large TVhi between the intervening DLA 
(SDSS-DR3) and the PDLA systems is only significant at tn 95% 
confidence level. 



The figure suggests that the Nm values of the PDLA 
sample are not drawn from the same parent population as 
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Table 4 
FITS TO f(N m ,X) 



Form 


Parameters 


ze|2.2,5.5) 


2<e[2.2,3.0) 


ze[3.0,5.5) 


Single 
Gamma" 


log fei 

ai 
log k 2 
log N~, 

Q-2 


20.30^ 
-2.041°;?? 
-24 S6+ 04 

> 21.7 
— 1.85i ; 12 


zo.oi_ 08 
9 91 +0.19 
z * zl -0.24 

ZJ - 1J -0 .08 
> 21.5 

-2 09+ 021 
z - uy -0.13 


19.28+^ 

-i.99tS:}° 

— 9zL Ozl+0-0 5 

> 21.6 

x - ' D -0.14 



a The 7V 7 parameter reported is a 95%c.l. lower limit. 

Note. — The errors reported are one-parameter errors which do not account for correlations among the parameters. 



intervening DLA systems. We find, however, that a two- 
sided Kolmogorov-Smirnov (KS) test gives a probability of 
30% for the null hypothesis. This rather high probability 
may be misleading, however, because the standard KS- 
test focuses on the median of the two distributions which 
is dominated by the lower Am values. From the Figure, it 
is evident that the two distributions only differ in the high 
Nni tail of the samples. Integrating the re- normalized T- 
function for the intervening DLA from Am = 10 215 cm~ 2 
to infinity, we predict 2.1 absorbers compared against the 
6 that are observed with log Am > 21.5. Even if we ig- 
nore the uncertainty in the predicted incidence, the Pois- 
son error on six absorbers implies they differ at only 2a 
significance (i.e. 95% c.l.). Therefore, we contend that 
the figure is suggestive but there is no conclusive evidence 
for a difference in the shape of /(Am, A) for the proxi- 
mate and intervening DLA systems. We also note that 
there is no statistically significant difference in /(Am, A) 
if we compare two bins of PDLA systems split at z = 3 
(Table gl). 

5.2. THE INCIDENCE OF PDLA SYSTEMS (£ PD la) 

The zeroth moment of the H I frequency distribution 
gives the incidence of absorbers along the sightline. Au- 
thors traditionally express this quantity per unit redshift, 
£(z). Of greater interest to the nature of the DLA systems 
is the incidence per unit absorption distance, £(X). We 
will consider both quantities. It is standard practice to 
discretely estimate l{z) with a calculation evaluated over 
a redshift interval [z min , z max \, 



£{z) 



mp DL A 



(4) 



where m pd la is the number of DLA systems within the 
redshift interval. 

For our evaluation of £{z) and £(X) we include a subtle 
but important modification. Although we have defined the 
search path of each quasar to be from Sv = km s _1 to 

we 



(Equations ll^p for quasars with , 



'•'mini ^rnax J i 



u prox 

also include all of the absorbers with Sv < km s _1 in our 
calculation of mpDLA- That is, we increment mpoLA by 
one for each damped Lya system with Sv < km s _1 even 
though we have defined the search path (Az)j to have Sv > 
km s _1 . At first glance, it appears that we will overes- 
timate £(z). We have adopted this formalism, however, 



N 

^< 
-J 
Q 



X 




Fig. 5. — Incidence of PDLA systems as a function of redshift per 
(a) unit redshift and (b) comoving distance. The dashed curve traces 
the incidence of intervening DLA systems (PHW05; Equation [5} . 
The enhancement in the incidence of PDLA systems at z = [2.5, 3.5] 
is significant at > 99% c.l. assuming Poisson statistics. 



to account for the relatively large systematic uncertainties 
in the quasar redshifts ( § HJ) - If these errors are roughly 
symmetric then we will have as many PDLA shifted out- 
side the sample as shifted in at the boundary v = v prox . 
Similarly, for any given quasar we will be searching a little 
more or less redshift path than 3000 km s _1 depending on 
the sense of the quasar redshift error, but the total search 
pathlength will be approximately correct. At the same 
time, however, we must include all PDLA systems with 
Sv < km s _1 to accurately assess £(z). 

Figure [5] presents (a) £{z) and (b) £(X) as a function of 
redshift for the PDLA systems. The curves in each panel 
correspond to 

< £{z) >= 0.6exp(-7/z 2 ) (5) 
which is a good representation^ of < £{z) > for the inter- 

8 The function is not properly normalized at z = but one could 
add a small offset to achieve this. 
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Table 5 
SUMMARY OF RESULTS 



z 


"IPDLA 


Az 


<pdla(2) 


AX 


^PDLAPO 


f^ ULA (xlO" 3 ) 


[2.2,2.5] 
[2.5,3.0] 
[3.0,3.5] 
[3.5,5.0] 


8 

22 
49 
28 


48.5 
59.6 
72.0 
58.6 


u.±oo_ 057 
n 369+ 097 

n 47S+ - 108 

0.478_ 090 


158.1 
206.6 
265.5 
235.2 


n 051 + UM ~ zi > 
u.uoi_ 018 

107+ - 028 
U-J-U' -0.022 

n iHK+0-030 

U.l(5O_ 02g 

n H9+0.027 
u.xj.»_q 022 


U.OD_ Q 23 
i o 9 +0.46 
1 - OZ -0.57 
3 03+ ' 74 
1 qo+067 



vening DLA systems at z > 2. In fact, a power-law repre- 
senta tion of the form £{z) oc (1+z) 7 (e.g. lStorrie-Lombardi fc 
2000) is no longer a good description of the observations 
at high z. 

The two panels show similar results. One observes an 
enhancement in the incidence of PDLA systems for z = 3 
to 3.5 and that the incidence of PDLA is consistent with 
the intervening values at other redshifts. Because the rep- 
resentation of £ given by Figure [5] may be sensitive to the 
binning, we also compare the proximate and intervening 
DLA systems as a function of redshift in histogram form 
(Figure [5]). Here, we have histogrammed the redshifts of 
the PDLA systems in the full sample and overplotted the 
predicted number calculated by convolving the redshift 
search path g(z) with the incidence of intervening DLAs 
(Equation [5]) . 

If we assume Poisson statistics, we find that the 95% c.l. 
intervals of the z = [2.5, 3] and z = [3, 3.5] do not overlap 
for the PDLA and intervening systems. Similarly, we find 
that the 99% c.l. intervals on I for z = [2.5,3.5] do not 
overlap. We conclude, therefore, that the PDLA systems 
exhibit a higher incidence than the intervening DLA in the 
redshift range z = [2.5,3.5] but are otherwise consistent 
with the cosmic average. We will attempt to interpret this 
rather unusual signal in the following section. 



30 



300 km s 1 < Sv < 6000 km s 1 , which even exceeded 
Wddfe.t for PDLAs with Sv < 3000 km s" 1 . Such an en- 
hancement is very unlikely to be related to absorber clus- 
tering near quasars. At z ss 3, Sv = 3000 km s _1 corre- 
sponds to rs 30Mpc/i _1 . Even fo r the strong high re dshift 
quasar clustering measured by IShen et al. I (|2007f) (e.g. 
£(r) = (r/r )" 7 with r ~ UMpch' 1 and 7 = 2 at z > 3) 
a relatively small enhancement is expected at such large 
distances from the quasar. In Figure [7] we present £(z) for 
3000 km s- 1 <Sv < 6000 km s" 1 adopting (a) the SDSS- 
DR5 quasar redshifts and (b) our revised values. Although 
the latter shows a mild enhancement in £(z) at z k, 3, the 
effect is significantly smaller than REB06. Adopting the 
revised redshifts, furthermore, we derive £(z) values con- 
sistent with the cosmic average. We suspect, therefore, 
that the REB06 result for v prox — 6000 km s^ 1 was pre- 
dominantly the result of spurious redshifts, although they 
reached qualitatively similar conclusions as our own when 
restricting to Sv < 3000 km s _1 . 
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Fig. 6. — Histogram of the absorption redshifts for the full PDLA 
sample. The dashed curve shows the predicted number of systems 
assuming the incidence of intervening DLA systems (Equation |5)l 
convolved with the PDLA selection function (Figure [TJ. 



A puzzling result from REB06 was that the authors 
reported an enhancement in £(z) for DLA systems with 
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(a) SDSS-DR5 (6000 km/s > 5v > 3000 km/s) 



(b) Revised z em (6000 km/s > 5v > 3000 km/s) 
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Fig. 7. — Incidence of DLA systems with 6000 km s _1 > Sv > 
3000 km s — 1 as (a) determined using z em values taken from the 
SDSS DR5 data release. Similar to REB06, we note an enhancement 
in ^(X)pdla at z ~ 3. After recalculating z em for all quasars 
with DLA candidates within 10,000km s _1 , we derive the results 
shown in panel (b). The incidence of DLA in this velocity interval 
is consistent (Xn u = 1-04 assuming four degrees of freedom) with 
the expected rate assuming the cosmic average with no error in this 
value. 



REB06 also reported an enhancement in PDLA of a fac- 
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tor £(z) = 1.4 < ^(z) > with » 2a significance at a mean 
redshift < z a bs >— 3.36. Our results indicate a larger en- 
hancement and much higher statistical significance at this 
redshift. Again, the differences are related to our larger 
sample size and errors in the z em values reported by the 
SPS S. We also note that our results more resemble those 
from lEllison et al. I ((2002) who had more precisely mea- 
sured QSO redshifts given the lower median redshift of 
their sample. 
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Fig. 8. — Mass density in neutral gas relative to the critical 
density for intervening DLA (light green points; PHW05) and the 
PDLAs (black). Similar to the results on the incidence of PDLA, we 
observe an enhancement in r2^ DLA at z ~ 3 yet results consistent 
with the cosmic average at other redshifts. 



5.3. THE H I MASS DENSITY WITHIN w 30/1" 1 Mpc 
OF QUASARS (^ ULA ) 

The first moment of the H I frequency distribution yields 
the mass density in H I atoms: 



Q g (X)dX 



prn H H 
cp c 



Nf(N m ,X)dX 



(6) 



where p is the mean particle mass per mn of the gas 
(taken to be 1.3), H is Hubble's constant, and p c is the 
critical mass density. B ecause the damped Lya systems 
dominate this integr al (jProchaska fc Herbert-Fort] 12004 
lO'Meara et al. 112007ft and because they are predominantly 
neutral, an evaluation of Equation[6]from N m i n = 2 x 10 20 cm' 
to infinity gives an accurate estimate of the mass density 
of neutral gas. For this analysis, we have focused on prox- 
imate DLA systems with 5v < 3000 km s _1 roughly cor- 
respondingly to a comoving 30 Mpc h~ l . We will refer 



to this quantity as ft 



PDLA 



the average mass density of 



neutral gas measured along an w 30Mpc h~ x path toward 
quasars with a median z-magnitude of 19.1 mag. 

Following standard practice (e.g. PHW05), we estimate 
j-jPDLA w jt n a discrete evaluation: 



qPDLA 



pm H H N m 



CPc 



AX 



(7) 



where the sum is performed over the Am measurements 
of the damped Lya systems in a given redshift interval 
with survey pathlength AX. The results are presented 
in Figure [5] and compared against the mass density for 
intervening damped Lya systems, ri^ LA , determined from 
the DR3 sample (PHW05). Similar to the zeroth moment 
of /(A/hi, X), we find that ftP DLA is consistent with fJ^LA 
at z < 3 but exceeds fl^ hA at z w 3.5. Although the 
uncertainty in Sl^ DLA is large, the enhancement over 17^ LA 
is significant at greater than 95% c.l. in the z — 3 to 3.5 
interval. 

There are two additional points to emphasize with re- 
gards to our estimation of il™ LA . First, we do not have a 
large enough sample of proximate DLAs to determine the 
inevitable break in the a « —2 power-law for /(Am, A) 
observed for Ahi < 10 213 cm" 2 . Therefore, our evaluation 
of r^ DLA is formally a lower limit at all redshifts. Sec- 
ond, as noted above, we are measuring DLA along biased 
sightlines. If bright quasars photoionize significant quan- 
tities of neutral hydrogen gas and anisotropically, we will 
underestimate f2^ DLA if magnitude- limited quasar surveys 
are biased to these viewing angles. We suspect that the 
first effect (sample size) is a less than factor of two effect 
while we will consider simple models of the latter effect in 
the following section. 

We have performed the following, simple calcluation in 
an attempt to estimate a lower limit to the dark mat- 
ter mass of halos hosting quasars. At z — 3, we have 
calculated the dark matter density within 30 Mpc /i -1 of 
the most massive dark matter halos in a cosmological box 
600 Mpc /i -1 on a side. The mean value never exceeds 
1.5 times the mean density at this epoch. We reach the 
obvious conclusion that the H I mass density must in- 
crease non-linear ly (e.g. n 2 H ) with dark matter overden- 
sity. Therefore, a direct comparison with our observations 
will require a full hydronamical simulation that includes 
an extragalactic background radiation field and radiative 
transfer. 

6. DISCUSSION 

In this section we physically interpret the incidence of 
PDLA systems measured in § 15.21 by considering two com- 
plimentary measurements: the clustering of optically thick 

absorbers around quasars in the transverse direction, mea- 

sured from close projected pairs of quasars (jHennawi fc Prochaskal 
2006) , and the strength and evolution o f the auto-cor relation 
function of high-redshift quasars (|Shen et al. Il2007t) . Be- 
cause PDLAs can be detected in absorption against bright 
background quasars out to z ~ 5 their statistics probe the 
^environments of quasars to m uch higher redshiftfl than 
galaxy cross-corr elation studies (|Adelberger fc Steidelll200"5l : 
ICoil et 0/71120071 ). However, the large ionizing flux emitted 

by a quasar can ionize the neutral gas in n earby galaxies, 

thus reducing their absorption cross-section ( Hcn nawi fc Prochaskal 
2006) . The interpretation of PDLA incidence as a measure 
of quasar-galaxy clustering is thus complicated by ioniza- 
tion effects. Nevertheless, the physical problem of a quasar 
illuminating a self-shielding optically thick absorber is very 
rich and can teach us much about both quasars and high- 

9 This redshift limit is set by the small number of quasars with 
z > 5. 
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Fig. 9. — Comparison of the evolution of the overdensity of PDLAs 
near quasars (1 + X| ) ( see eqn.(9)l to those implied by other measure- 
ments. The (green) data points show our measurements of the line- 
of-sight correlation function from the abundance of PDLAs, where 
the horizontal error bars indicate the size of the redshift bins used. 
The solid curve shows the line -of-sight correlatio n function predic- 
tion obta ined by combining thelShen et al. I 1120071) clustering results 
with the |H ennawi & Prochaska (2006) transverse quasar-absorber 
clustering measurement for 7 = 2. The upper (red) striped region 
illustrates 1 — a error from combining thes e results. The dashed 
curve shows the same quantity, but using the Hcnnawi & Prochaska 



(2006) measurement with a shallower slope of 7 = 1.6 (the 1 
error region is comparable to the red striped region but not shown 
for clarity). The dotted curve and lower (blue) striped region are 
the line-of-sight correlati on function and c orresponding 1 — a er- 
ror when we combine the IShen et al. d2007l) cluste r ing re sults with 
DLA auto-correlation implied by the [Cooke et aLj (120061 ) measure- 
ments. The (red-blue) cross-hatching indicates the overlap of the 
error regio ns, and illustrates th at the (Hcnnawi & Prochaskal l2006l ) 
results and lCooke et al~\ (120061 ) measurement are consistent within 
the errors. 



Following lHennawi fc Prochaskal(|2006f ), we can describe 
the increase or decrease in the line density, £(z), of proxi- 
mate DLAs at a velocity separation v from a quasar with 
the 'line-of-sight' (LOS) correlation function 

£(z,v,Av) = (e(z))[l + X \\(v,Av)}, (8) 

where (£(z)) represents the cosmic average line density 
(PHW05) and Av indicates the width of the velocity in- 
terval searched. The LOS correlation function is given 
by an average of the quasar-absorber correlation function, 
£qso-dla(7), over a cylinder with cross-section A equal 
to the DLA cross-section and extent 2Av/aH(z) along the 
line-of-sight direction. Here a is the scale factor and H (z) 
is the Hubble constant at redshift z. For velocity separa- 
tions v close to the quasar, %|| depends on the size of the 
absorber cross-section, which is unknown. In what follows, 
we assume a plausible radius of r a bs = V ' A/it = 20 kpc. 
Increasing r a b s by a factor of two results in a factor of 
two d ecrease in x\\ (see Fig ure 2 of 1 ennawi fe Prochaskal 
1200611 . 

Usin g projected pairs of quasars. iHennawi fe Prochaskal 
( 2006|) measured the cross-correlation between foreground 
quasars at z ~ 2.5 and optically thick iVm > 10 19 cm 2 
absorbers, detected in the background quasar spectra. As- 



suming a power law shape for the quasar-absorber cross- 
correlation function, they measured ro = 9.21^'y h^ 1 Mpc 
for 7 = 1.6, or r = 5.8±ls hr 1 Mpc for 7 = 2. This mea- 
surement of the transverse clustering of absorbers around 
quasars should be commensurate with the line-of-sight clus- 
tering probed by the incidence of PDLA systems, provided 
that the clustering pattern of absorbers around qu asars is 
isotropic. However, IHennawi fc Prochaskal (|2006h argued 
that the strength of the transverse clustering predicts that 
~ 15 — 50% of all quasars should show a Nm > 10 19 cm 2 
absorber within Av < 3000 km s _1 , which is not observed 
(jO'Meara et al. II2007D . Thus, the tr ansverse clustering 
overpredicts the number of absorbers along the line-of- 
sight by a large factor, or equivalently the clustering pat- 
tern is highly anisotropic. The most plausible physical 
explanation is that the transverse direction is less likely 
to be illuminated by ionizing photons than the line-of- 
sight, either due to anisotropic emission, or variability of 
the quasar emission on timescales short compared to the 
transverse light crossing time (~ 5 x 10 5 yr). We revisit 
this issue of anisotropic clustering with the statistics of 

P DLAs below 

IShen et al. I (|2007t ) recently quantified the clustering of 
high-rcdshift (z > 2.9) quasars and measured a comov- 
ing auto-correlation length of ro = 16.9 ± 1.7 h^ 1 Mpc for 
quasars in the redshift range 2.9 < z < 3.5, and an even 
stronger ro = 24.3 ± 2.4 hr 1 Mpc for z > 3.5 quasars. 
High redshift quasars are thus much more strongly clus- 
tered than their z ~ 2 counterparts, which have rn ~ 
7.5 h~ x Mpc (see e.g. iPorciani, Magliocchetti fc Nprbera 
120041: ICroom et al. Il2005t IPorciani fc Norbergj|2006l ). and 
the evolution of the quasar clustering strength with red- 
shift is extremely rapid. Naively, we expect this to drive 
a rapid evolution in the cross-correlation of quasars with 
DLAs. 

Our strategy for comparing the PDLA clustering to these 
complimentary measurements is to normalize t he quasar- 



DLA c ross-correlation strength to the transverse lHennawi fc Procha 
(2006) result at z ~ 2.5, and assume that the clustering 
of DLAs does not evolve with red shift, but that the quasar 
clustering evolves as measured bv lShen et al. I (|2007l ) . Specif- 
ically, we use the relation 



£QSO-DLA(r, z) = y £QSO-QSo(f, z)£DLA-DLA(r); ( 9 ) 

to determine the parameters of the £dla-dla and prop- 
agate e rrors according to those quoted by both measure- 
ments. IShen et a/~l (|2007t ) measured the auto-correlation 
function of quasars in two redshift bins 2.9 < z < 3.5 
and z > 3.5, very similar to the two high redshift bins in 
this work. Quasar clustering has not yet been measured 
in the redshift r ange 2.1 < z < 2.9, so we linearly interpo- 
late between the lShen et al. (12007ft result and the redshift 
evolution measured by IPorciani fc Norberd (|2006TR 

A few limitations to our approach should be noted. First, 
by assuming the DLA clustering does not evolve, we might 
overestimate (underestimate) the strength of the quasar- 
DLA cross-correlation function if DLA-g alaxies are less 
(more ) clustered at high redshift. However. I Adelberger et aT~\ 
(2005) found that the clustering of LBGs is very nearly 

10 A small correction is applied to account for the fact that Shcn 
et al. used a different power law slope (7 = 2) than Porciani & 
Nordberg (7 = 1.8). 
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constant in the range r$ ~ 4 — 5 h~ x Mpc over th e redshift 
range 1.4 < z < 3.5, and lOuchi et al\ (|2004h similarly 
measured the correlation lengths of 3.5 < z < 5.2 LBGs 
and found a correlation length of vq ~ 5 Mpc which 
was nearly constant with redshift. In light of evidence that 
DLA-galaxies are drawn from a similar (but fai nter) galaxy 
popu l ation as spect roscopically selected LBGs (jMoller et al. I 
2002; Schayc 2001), it is reasonable to assume th at DLA 
cluste ring does not evolve significantly. Second, the lHennawi fc 
(2006) measurement was for all absorbers with Ahi > 
10 19 cm 2 , so by applying that measurement to the DLAs 
(Nm > 2 x 10 20 cm -2 ) we implicitly assume that the 
clustering of absorbers is independe nt of column densi t'sPI. 
Final ly, it is conceivable that the ([Hennawi fe Proch aska 
l2006h cross-correlation strength is over-estimated because 
of linc-blcnding of systems near the column density thresh- 
old (iVm > 10 19 cm -2 ), which results in a 'Malmquist'- 
type bias because the li ne density £(z) of abs orbers is a 
relatively steep function (|Q'Meara et al. Il2007t ) of column 
density limit. 

To address these issues we explore another avenue for es- 
timating the autoco rrelation of £dla-dla m eqn.[51 Namely, 
I Cooke et al. I (|2006l ) used photometrically selected LBGs 
in the vicinity of DLAs to measure the cross-correlation 
between LBGs and DLAs at z ~ 3. Assuming 7 = 1.6 
they measured tq — 2.93lt 15 for the DLA-LBG cross- 
correlation, and ro = 3.32±0.6 for the LBG auto-correlation. 

From £dla-dla = \/£dla-lbg/£lbg-lbg, we deduce 
the auto-correlation of DLAs to be ro = 2.6 ±2.7 h^ 1 Mpc 
(7 = 1.6), which can then be used in eqn.[5J again assum- 
ing that that the DLA clustering does not evolve with 
redshift. 

Our measurement of the evolution of the overdensity of 
PDLAs near quasars (1 + X||) is compared to the implica- 
tions of these other measurements in Figure^ The (green) 
data points illustrate our measurement of the line-of-sight 
correlation function from the abundance of PDLAs (see 
eqn. [8]). The solid curve shows the line-of-si ght correla- 
tion fu nction prediction when we c ombine thelShen et al. 
2007 ) clustering results with the iHennawi fe Prochaska 
2006h quasar-absorber clustering measurement for 7 = 2. 



quasars and lack of evolution, measured here. This dis- 
agreement provides compelling evidence for the hypothe- 
sis that the ionizing flux from quasars photoevaporates H I 
in nearby DLA-galaxi es, thus reducing their c ross-s ection 
for DLA absorption. IHennawi fc P rochaskal ( 20061 ) used 
a simple photoevaporation model to show that optically 
thick absorbers with tih < 0.1cm -3 will be photoevapo- 
rated if th ey lie within ~ 1 Mpc of a luminous quasar, so 
Brochaaliis arc sensible on physical grounds. 



The upper (red) striped region illustrates the implied la 
error from combining these results. The dashed curve 
shows the same quantity, but using the[H ennawi fc Prochaskal 
(2006) measurement with a shallower slope of 7 = 1.6 (the 
la error region is comparable to the red striped region but 
not shown for clarity). The dotted curve and lower (blue) 
striped region are the line-of-sight correlation function and 
corres ponding la error when we combine the IShen et al. I 
( 2007J) clus tering results with DLA auto-correlation im- 
plied by the lCooke et al. I (|2006l ) measurements. The (red- 
blue) cross-hatching indicates the overlap of the error re- 
gions, and i llustrates that the (IHennawi fc Prochaskar2 006) 
results and ICooke et al \ (|2006F measurement are consis- 
tent within the errors. 

A discrepancy is apparent between the predictions in 
Figure O which are driven by the strength of the quasar 
auto-correlation function and its rapid redshift evolution, 
and the relatively modest clustering of PDLAs around 

11 This is a reasonable assumption if absorbers with TVhi ~ 
10 19 cm -2 ar ise from material at larger impact parameters from DLA 
systems (e.g. IMaller et al. ||2003|) . 



The decrease in the number of optically thin Lya for- 
est absorption lines in the vicinity of quasars, known as 
the proximity effect, has been detected and is well stud- 
ied i Baitlik, Duncan fc Ostrikerlll988t IScott et al. 1 120001: 
iFaucher-Giguere et al. 1120071) . The factor of ~ 5 - 10 dis- 
crepancy between the predicted clustering results and the 
data in Figure O provides compelling evidence that a sim- 
ilar proximity effect exists for opti cally thick absorption 
line sy stems. However, according to lHennawi fc Prochaskal 
(2006), there is no transverse proximity effect for optically 
thick absorbers, which gains further credibility in light of 
the null detections of the transverse effect in the (opti- 
cally thin) Lya forest (ICrottsl[l9 89; Dob rzvcki fc Bechtolc 



1 19911: iFernandez-Soto et al. 1 119951: iLiske fc Willige 
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TCroftl 



2001: 
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ISchirber. Miralda-Escude fc McDonald! 12004 
but see Worseck & Wisotzki (2006)), although these trans- 
verse studies are all based on only a handful of projected 
pairs. Both the optically thin results and our result for 
optically thick systems can be explained if the transverse 
direction is less likely to be illuminated by ionizing pho- 
tons than the line-of-sight, either because the emission is 
anisotropic, or because the foreground quasar varies on 
timescales short compar ed to the transverse light cro ssing 
time (~5x 10 5 yr) (see IHennawi fc Pro chaska 2006, for a 
detailed discussion of both) . 
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Fig. 10. — Toy photoevaporation model predictions for the evolu- 
tion of the overdensity of PDLAs near quasars. The solid, dashed, 
and dotted curves represent the same measurements shown in Fig- 
ure [9] but allowing for a reduction in the clustering signal because 
optically thick absorbers are photoevaporated by the large ionizing 
flux if they are close enough to the quasar. The volume density of 
neutral hydrogen was assumed to be njj =0.1 cm 3 . 



We use a toy model to illustrate how quasar-absorber 
clustering can be used to constrain the physical properties 
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of DLAs following iHennawi &: Prochaskal ((2006) , who in- 
troduced a photoevaporation criterion for optically thick 
ab sorbers i l lumin ated by quasars, motivated by the work 
of iBertoIdl |l989). This criterion allows us to compute a 
minimum distance from the quasar, as a function of vol- 
ume density, at which a DLA can self-shield against photo- 
evaporation. In this context, the curves in Figure [5] repre- 
sent predictions for the intrinsic clustering of DLAs around 
quasars in the absence of ionization effects. Because prox- 
imate absorbers lie along the line-of-sight, they must be 
exposed to the quasars ionizing flux, and we can calculate 
the reduction in the clustering strength becau se of photo- 
evaporation (see lHennawi fc Procha ska 2006, for details). 
The solid, dashed, and dotted curves in Figure [10] rep- 
resent predictions from the same measurements shown in 
Figure [5] but after taking photoevaporation into account 
using this simple approach. We used a volume density of 
hydrogen of njj = 0.1 cm~ 3 and assumed the quasar had 
an average z-band magnitude i = 19.1, which is the median 
of our PDLA sample. Although crude, this simple model 
illustrates how the density distribution in DLAs can be 
measured by comparing the abundance of PDLAs to the 
intrinsic quasar-DLA clustering, deduced either from the 
transverse quasar-DLA clustering or by combining inde- 
pendent measurements of quasar and DLA clustering. 

7. SUMMARY 

We surveyed the spectra of 5938 SDSS quasars in the 
redshift range 2.2 < z < 5 for proximate DLAs within 
3000 km s _1 of the quasar emission redshift, and presented 
the largest sample (111 systems) of PDLAs uncovered to 
date. Robust systemic redshifts of the quasars hosting 
these PDLAs were computed using a redshift estimator 
which which accounts for the relative shifts between quasar 
emission lines and the systemic frame, and which ignores 
the Lya emission line, which can be entirely absorbed for 
quasars with PDLAs. These improved redshifts allowed 
us to measure the abundance and distribution of PDLAs 
near quasars. The primary conclusions of this study are: 

1. There is suggestive evidence that the Nm values 
of the PDLA population are not drawn from 
the same parent population as intervening DLA 
systems. Specifically, the PDLA sample does not 
show evidence for a break from a single power-law 
model. However, the statistical significance of 
the difference between the PDLA column density 
distribution and the underlying distribution of 
intervening DLAs is only at the 2a level (i.e. 95% 
c.L). 

2. PDLA systems exhibit a higher incidence than the 
intervening DLAs in the redshift range ze[2.5, 3.5], 
but are consistent with the cosmic average at lower 
(ze[2.2, 2.5]) and higher (ze[3.5, 5.0]) redshifts. 

3. The average mass density meaured along an 

w SOMpcft." 1 path toward the quasars we surveyed, 
£lg DLA , is consistent with the cosmic average, 
ft° LA , at z < 3.0 and z > 3.5. In the redshift 
interval ze[3. 0,3.5] an enhancement in ri^ DLA is 
detected at greater than 95% c.l. with central 
value f^ DLA 2 to 3 times ft° LA . 



4. A comparison of the strength of the quasar 
auto-correlation function and its rapid redshift 
evolution, to the relatively modest clustering of 
PDLAs around quasars and a lack of significant 
evolution, provides compelling evidence for the 
hypothesis that the ionizing flux from quasars 
photocvaporates H I in nearby DLA-galaxies, thus 
reducing their cross-section f or DLA absorption 
(|Hennawi fc Proch aska 200l|). 
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